AMENDMENTS 



In the claims: 

Please amend Claim 1 1 to read as follows. 

1. (previously presented) An isolated nucleic acid molecule comprising the 
nucleotide sequence of SEQ ID NO: 1. 

2. (original) An isolated nucleic acid molecule comprising a nucleotide sequence 

that: 

(a) encodes the amino acid sequence shown in SEQ ID NO: 2; and 

(b) hybridizes under stringent conditions to the nucleotide sequence of 
SEQ ID NO: 1 or the complement thereof. 

3. (original) An isolated nucleic acid molecule comprising a nucleotide sequence 
that encodes the amino acid sequence shown in SEQ ID NO: 2. 

4. (original) An isolated nucleic acid molecule comprising a nucleotide sequence 
that encodes the amino acid sequence shown in SEQ ID NO:4. 

5. -10. (cancelled) 

1 1 .(currently amended) An expression vector comprising [a] the nucleic acid 
sequence of Claim 4. 

12.(previously presented) A cell comprising the expression vector of Claim 11. 
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RESPONSE 



I. Status of the Claims 

Claim 11 has been amended as suggested by the Examiner. Claims 1-4, 1 1 and 12 are 
pending . 

II. Support for the Amended Claims 

Amended Claim 11 finds support in original Claim 11 which found support in original Claim 
4, throughout the specification as originally filed with particular support being found at least on page 
13, lines 25-32. 

As the amendments to Claim 1 1 are fully supported by the specification and claims as 
originally filed, they do not constitute new matter. Entry therefore is respectfully requested. 

IIL Rejection of Claims Un der 35 U.S.C. § 112. Second Parap raph 

Clam 11 stands rejected under 35 U.S.C. § 112, second paragraph, as being indefinite for 
failing to particularly point out and distinctly claim the subject matter which applicant regards as the 
invention. Although Applicants in no way agree, in order to further progress this application towards 
allowance, Claim 11 has been amended exactly as the Examiner requested and thus this rejection 
has been avoided and Applicants' therefore request its withdrawal. 

IV. Rejection of Claims Under 35 U.S.C. S 101 

The rejection of claims 1-4, 11-12 under 35 U.S.C. § 101 is maintained because the claimed 
invention allegedly is not supported by either a specific and substantial asserted utility or a well- 



established utility. This rejection is respectfully traversed, based on the following arguments as well as 
those presented in earlier responses. 

The rejection of claims 1-4 is maintained in this Final Action which again asserts that Applicants 
have failed to identify the function of the protein encoded by the sequences of the present invention and 
that therefore there can be no specific, substantial or credible utility. 
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The Final Action dismisses Applicants' continued assertions that the protein of the present 
invention is a human semaphorin protein and that semaphorin protein function is both well known and 
im P lied to mose of sk ill in the art. The Action at page 3, line 5, cites Bork (Genome Research 70:398- 
400, 2000) as supporting the proposition that prediction of protein function from homology information 
is somewhat unpredictable. However, a careful reading of Bork' s publications and the other "relevant 
literature" does not in fact support the concept that function cannot be based on sequence and structural 
similarity, in contrast many of the examples actually support the use of such methodologies while 
identifying several areas in which caution should be exercised. These inaccuracies and potential pitfalls 
can be overcome by a more careful analysis by those of skill in the art. Automatic methods of sequence 
homology identification was only the staring point for consideration the sequences of the present 
invention underwent careful analysis by a series of individuals of skill in the art, many highly qualified 
(B.S. and Ph.D. level scientists). 

These articles are merely examples of a small number of spurious publications that call into 
doubt the usefulness of bioinformatic predictions and that the PTO has repeatedly attempted to use as 
a basis to deny the utility of nucleic acid sequences. However, without going into the merits (or lack 
thereof) of all of the cited articles, Appellants point out that the lack of 100% unanimous agreement on 
the usefulness of bioinformatic prediction programs is completely irrelevant to the question of whether 
the claimed nucleic acid sequence has a substantial and specific utility. Appellants respectfully point out 
that the legal test for utility simply involves an assessment of whether those skilled in the art would find 
any of the utilities described for the invention to be believable . Appellants submit that the 
overwhelming majority of those of skill in the relevant art would belfeye bioinformatic prediction to be 
a powerful and useful tool, as evidenced by hundreds if not thousands of journal articles. 

Rather, the question of uti lity is a straightforward one. As set forth by the Federal Circuit, ' '(t)he 
threshold of utility is not high: An invention is 'useful' under section 101 if it is capable of providing 
some identifiable benefit." Juicy Whip Inc. v. Orange Bang Inc. , 5 1 USPQ2d 1 700 (Fed. Cir. 1 999) 
(citing Brennerv. Manson, 383 U.S. 519, 534(1966)). Additionally, the Federal Circuit has stated 
that "(t)o violate § 101 the claimed device must be totally incapable of achieving a useful result." 
Brooktree Corp. v. Advanced Micro Devices, Inc., 977 F.2d 1555, 1571 (Fed. Cir. 1992), 
emphasis added. Cross v. Iizuka (224 USPQ 739 (Fed. Cir. 1985); "Cross") states "any utility of 
the claimed compounds is sufficient to satisfy 35 U.S.C. § 101". Cross at 748, emphasis added. 



Indeed, the Federal Circuit recently emphatically confirmed that "anything under the sun that is made 
by man" is patentable (State Street Bank & Trust Co. v. Signature Financial Group Inc., 47 
USPQ2d 1596, 1600 (Fed. Cir. 1998), citing the U.S. Supreme Court's decision in Diamond vs. 
Chakrabarty, 206 USPQ 193 (S.Ct. 1980)). 

The legal test for utility simply involves an assessment of whether those skilled in the art would 
find any of the utilities described for the invention to be credible or believable. According to the 
Examination Guidelines for the Utility Requirement, if the applicant has asserted that the claimed 
invention is useful for any particular purpose (i.e., it has a "specific and substantial utility") and the 
assertion would be considered credible by a person of ordinary skill in the art, the Examiner should not 
impose a rejection based on lack of utility (66 Federal Register 1098, January 5, 2001). 

As evidence of the credibility of Applicants assertion that the present invention is a variant of 
human semaphorin sem 2, In Applicants' response to the First Office Action (Paper No. 13) 
Applicants' submitted an amino acid sequence comparison between SEQ ID NO: 3 andBAA98132 
(as Exhibit E), which was annotated by third party scientists, wholly unaffiliated with Applicants, as 
encoding semaphorin sem2 [Homo sapiens] (BAA98132: as Exhibit F). In this submission was 
included evidence that SEQ ID NO: 1 (see previously submitted Exhibit G comparing SEQ ID NOS: 
3 and 1 ) identifies a longer isoform of the present invention, which is clearly encoded by the same 
genetic locus. Clearly those of skill in the art would recognize the sequences of the present invention 
as encoding a human semaphorin. As evidenced by the review article entitled "Molecular Mechanisms 
of Axonal Guidance" from the prestigious journal Science (298:1959-1964, 2002 and erratum; 
previously submitted as Exhibit H in Paper No. 1 3), semaphores are well known to those of skill in the 
art as soluble and membrane-bound proteins that act as chemorepulsive factors in neuronal 
development, thereby playing a crucial role in axon guidance. Semaphoring such as the one described 
in the present invention, provide guidance for neuronal growth. In the second paragraph of Section 5. 1 
or the specification as filed, it is stated that "Because of theirrole in neural development, semaphores 
have been subject to considerable scientific scrutiny. For example, U.S. Patents Nos. 5,981,222 and 
5,935,865, both of which are herein incorporated by reference, describe other semaphores as well as 
applications, utilities". Therefore, clearly, there can be no question that Applicants' asserted identity 
and utility for the described sequences a semaphorin is " credible ." In addition, those of skill in the art 
in the biomedical and pharmaceutical industry would readily recognize the utility for semaphores and 
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their application to medical conditions requiring nerve regeneration. For example, the regeneration and 
repair of nerve tissue following the surgical attachment of severed limbs or the resection of diseased 
tissue, as well as nerve repair following a stroke. 

Further support of Applicants' position that the function of the protein encoded by the 
sequences of the present invention is that of semaphorin sem 2 is further provided by the nucleotide 
sequence encoding the previously presented protein (BAA98 1 32) shares 99.957 % percent homology 
over the entire nucleic acid sequence of SEQ ID NO:3 (nucleic acid alignment presented as New 
Exhibit AA; GenBank- accession number AB029496). 

Applicants have thus supplied evidence supporting their assertion that those of skill in the art 
would recognize that the sequences of the present invention encode variants of human semaphorin. 
Applicant's assertion also supports a "well-established" utility in that persons of ordinary skill in the art 
would immediately appreciate. In contrast, the Examiner has provided no evidence of record indicating 
that those of skill in the art would not recognize the sequences of the present invention encode 
semaphorin. As such, the scientific evidence clearly establishes that Applicants have described an 
invention whose utility is in full compliance with the provisions of 35 U.S.C. § 101 , and therefore 
Applicants respectfully request withdrawal of the rejection. 

The Final Action states that there is no disclosure in the specification suggesting that the 
sequences of the present invention as encoding the biological activity of human semaphorins (page 3 
lines 9-10). However, the application clearly identifies similarities between the sequences of the present 
invention (SEQ ID NOS: 1-5) and semaphorin proteins (at least on page 2, lines 14-15; page 4, lines 
10-1 1 and page 17, line 10) and their tissue expression distribution (page 4, lines 10-15) and describes 
the activity of semaphorins (page 4, lines 10-15) and well-established utility "Because, of theirrole in 
neural development, semaphorins have been subject to considerable scientific scrutiny. For example, 
U.S. Patents Nos. 5,981,222 and 5,935,865, both of which are herein incorporated by reference, 
describe other semaphorins as well as applications, utilities, and uses ..." (page 17, lines 14-18). 
Clearly Applicants were aware at the time of filing of the semaphorin like nature of the protein encoded 
by sequences of the present invention. 

Furthermore the Examiner' s position that mere homology of SEQ ID No: 1 to a known DNA 
molecule with a known function does not endow SEQ ID NO. l with the function is contrary to 
Example 10 of the PTO's Revised Interim Utility Guidelines Training Materials (pages 53-55), which 



establishes that a rejection under 35 U.S.C. § 101 as allegedly lacking a patentable utility and under 
35 U.S.C. § 1 12, first paragraph as allegedly unusable by the skilled artisan due to the alleged lack of 
patentable utility, is not proper when there is no reason to doubt the asserted utility of a full length 
sequence (such as the presently claimed sequence) that has a similarity score of 95% to a protein 
having a known function. In the Analysis portion of Example 10 it states that "Based on applicant's 
disclosure and the results of the PTO search, there is no reason to doubt the assertion that SEQ ID 
NO:2 encodes a DNA ligase. Further DNA ligases have a well-established use in the molecular 
biology art based on this class of proteins ability to ligate DNA Note that if there is a well- 
established utility already associated with the claimed invention, the utility need not be asserted in the 

specification as filed Thus the conclusion reached from this analysis is thata35U.S.C. § 101 and 

a 35 U.S.C. § 112 first paragraph, utility rejection should not be made." 

The present case is similar to that presented in Example 10 of the Revised Interim Utility 
Guidelines Training Materials (pages 53-55). In the present case it is clear that the sequences of the 
present invention encode a semaphorin. Semaphorins have a well-established function. Thus a 35 
U.S.C. § 101 and a 35 U.S.C. § 1 12 first paragraph, utility rejection should not have been made and 
should therefore, be withdrawn. 

As set forth in In re Longer (183 USPQ 288 (CCPA 1974); "Langer"): 

As a matter of Patent Office practice, a specification which contains a disclosure of 
utility which corresponds in scope to the subject matter sought to be patented must be 
taken as sufficient to satisfy the utility requirement of § 101 for the entire claimed 
subject matter unless there is a reason for one skilled in the art to question the objective 
truth of the statement of utility or its scope. 

Langer at 297, emphasis in original. As set forth in the MPEP, "Office personnel must provide 
evidence sufficient to show that the statement of asserted utility would be considered 'false' by a person 
of ordinary skill in the art" (MPEP, Eighth Edition at 2 100-40, emphasis added). Thus, absent such 
evidence from the Examiner concerning the role of the presently claimed sequence encodes a protein 
kinase, the present claims clearly meet the requirements of 35 U.S.C. § 101. 

The Action also disregards Applicant's asserted utility of the presently claimed polynucleotides 
on DNA chips (Action at page 3, Section 4.(ii)). Further, the Action seems to be requiring Applicants 
to identify the biological role of the nucleic acid or function of the protein encoded by the presently 
claimed polynucleotides before the present sequences can be used in gene chip applications that meet 
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the requirements of § 10 1 . Applicants respectfully point out that knowledge of the exact function or role 
of the presently claimed sequence is not required to track expression patterns using aDNA chip. Given 
the widespread utility of such "gene chip" methods usingpublic domain gene sequence information, 
there can be little doubt that the use of the presently described novel sequences would have great utility 
in such DNA chip applications. Particularly as Applicants have identified the protein encoded as a 
semaphorin, identified the specific tissues in which this gene is expressed (page 4, lines 1 0- 1 5) and 
identified a specific polymorphism in SEQ ID NO: 1 (page 17, lines 8-18). The claimed sequence 
provides a specific marker of the human genome (see evidence below), and that such specific markers 
are targets for discovering drugs that are associated with human disease. Thus, those skilled in the art 
would instantly recognize that the present nucleotide sequence would be an ideal, novel candidate for 
assessing gene expression using, for example, DNA chips, as the specification details. Such "DNA 
chips" clearly have utility, as evidenced by hundreds of issued U.S. Patents, as exemplified by U.S. 
Patent Nos. 5,445,934, 5,556,752, 5,744,305, as well as more recently issued U.S. Patent Nos. 
5,837,832, 6, 1 56,50 1 and 6,261 ,776. Accordingly, the present sequence has a specific utility in such 
DNA chip applications. Clearly, compositions that enhance the utility of such DNA chips, such as the 
presently claimed nucleotide sequence, must also be useful. 

Additionally, since only a small percentage of the genome (2-4%) actually encodes exons, 
which in-turn encode amino acid sequences. Thus, not all human genomic DNA sequences are useful 
in such gene chip applications. Thus, the present claims clearly meet the requirements of 35 U.S.C. § 
101. It has been clearly established that a statement of utility in a specification must be accepted absent 
reasons why one skilled in the art would have reason to doubt the objective truth of such statement. 
In re hanger, 503 F.2d 1380, 1391, 183 USPQ 288, 297 (CCPA, 1974); In re Marzocchi, 439 
F.2d 220, 224, 169 USPQ 367, 370 (CCPA, 1971). 

Evidence of the "real world" substantial utility of the present invention is further provided by the 
fact that there is an entire industry established based on the use of gene sequences or fragments thereof 
in a gene chip format. Perhaps the most notable gene chip company is Affymetrix. However, there are 
many companies which have, at one time or another, concentrated on the use of gene sequences or 
fragments, in gene chip and non-gene chip formats, for example: Gene Logic, ABI-Perkin-Elmer, 
HySeq and Incyte. In addition, one such company, Rosettalnpharmatics, was viewed to have such 
"real world" value that it was acquired by large pharmaceutical company, Merck & Co., for substantial 



sums of money (net equity value of the transaction was $620 million). The "real world" substantial 
industrial utility of gene sequences or fragments would, therefore, appear to be widespread and well 
established. Clearly, persons of skill in the art, as well as venture capitalists and investors, readily 
recognize the utility, both scientific and commercial, of genomic data in general, and specifically human 
genomic data. Billions of dollars have been invested in the human genome project, resulting in useful 
genomic data (see, e.g., Venter etal, 2001, Science 291: 1304). The results have been a stunning 
success as the utility of human genomic data has been widely recognized as a great gift to humanity (see, 
e.g.,JasnyandKennedy,2001,Science297:1153). Clearly, the usefulness of human genomic data, 
such as the presently claimed nucleic acid molecules, is substantial and credible (worthy of billions of 
dollars and the creation of numerous companies focused on such information) and well-established Ythe 
utility of human genomic information has been clearly understood for many years). The use of the 
claimed polypeptide in an array for screening purposes Applicants respectfully point out that nucleic 
acid sequences have the greatest specific utility in gene chip applications once the role of the sequence 
has been identified, as have tissues of interest, as in the present case. Once the role of the particular 
nucleic acidis known, the level of gene expression has andeven greater significance. By identifying the 
physiological activity role of theclaimed sequence, theclaimed sequence has a far greater utility in gene 
chip applications that just any random piece of DNA. 

As a still further example of utility is the use of the present sequences in such diagnostic assays 
(at least at page 9, line 7; page 18 line 1 1 ; page 25, line 32) as those associated with identification of 
paternity and forensic analysis, among others. The sequences of the present invention have particular 
utility as the application as filed identified a polymorphism in SEQ ID NO: 1 (page 17, lines 8-18). This 
is also not a case of a potential utility. Appellants respectfully submit that even in the worst case 
scenario, the described polymorphisms are each useful to distinguish 50% of the population (in other 
words, the marker being present in half of the population) and that the ability of a polymorphic marker 
to distinguish atleast 50% of the population is an inherent feature of any pnlymntph^ marker, and this 
featureis well understood by those of skill in the art. Appellants note that asamatteroflaw,itis well 
settled that a patent need not disclose what is well known in the art. In re Wands, 8 USPQ 2d 1400 
(FedCir. 1988). Appellants support for Appellants' assertion of utility is provided by the fact that the 
skilled artisan would readily recognize and easily believe that the presently described polymorphic 
markets couldbeusemlinforensk^ 



such as those described by Appellants everyday provides more that ample support for the assertion 
that forensic biologists would also be able to use the specific polymorphic markers described by 
Appellants in the same fashion. Therefore, again it is clear that the sequences of the present invention 
have utility. 

Applicants respectfully submit that specific utility, which is the proper standard for utility under 
35 U.S .C. § 101 , is distinct from the requirement for a unique utility, which is clearly an improper 
standard. As clearly stated by the Federal Circuit in Carl Zeiss Stiftung v. Renishaw PLC, 20 
USPQ2d 1101 (Fed. Cir. 1991; "Carl Zeiss"): 

An invention need not be the best or only way to accomplish a certain result, and it 
need only be useful to some extent and in certain applications: "[T]he fact that an 
invention has only limited utility and is only operable in certain applications is not 
grounds for finding a lack of utility." Envirotech Corp. v. Al George Inc 221 USPO 
473, 480 (Fed. Cir. 1984) ' " 

Therefore Just because other nucleic acid sequences find utility in gene chip applications does not mean 
that the use of Applicants* sequence in gene chip applications is not a specific utility. Furthermore, the 
requirement for a unique utility is clearly not the standard adopted by the Patent and Trademark Office. 
If everyinvention were required to have a unique utility, the Patent and Trademark Office wouldno 
longer be issuing patents on batteries, automobile tires, golf balls, golfclubs, and treatments for a variety 
of human diseases, such as cancer and bacterial or viral infections, just to name a few particular 
examples, because examples of each of these have already been described and patented. All batteries 
have theexact same utility -specifically, to provide power. All automobile tires have the exact same 
utility-specifically.foruseon automobiles. All golf balls andgolf clubs have the exact same utility - 
specifically, use in the game of golf. All cancer treatments have the exact same utility - specifically, to 
treat cancer. All anti-infectious agents have the exact same broader utility - specifically, to treat 
infections. However, only the briefest perusal of virtually any issue of the Official Gazette provides 
numerous examples of patents being granted on each of the above compositions every week . 
Furthermore, if a composition needed to be unique to be patented, the entire class and subclass system 
would be an effort in futility, as the class and subclass system serves solely to group such common 
inventions, which would not be required if each invention needed to have a unigue utility. Thus, the 
present sequence clearly meets the requirements of 35 U.S.C. § 101. 
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The Action further discounts the utility ofthe claimed sequence for exon mapping as there is 
"no knowledge of what the function ofthe actual sequence, regardless ofthe basis of homology, then 
there is no asserted utility"(Final Action Page 4, line 4-6). Applicants respectfully submit that the 
function ofthe sequence as a semaphorin and several utilities were asserted in the application as filed 
(addressed above). 

AMoughAppHcantsneed^ 
of 35 U.S.C. § 101 {Raytheon v. Roper, 220 USPQ 592 (Fed. Cir. 1983); In re Gottlieb, 140 
USPQ 665 (CCPA 1964); In re Malachowski, 189 USPQ 432 (CCPA 1976); Hoffman v. Klaus, 
9 USPQ2d 1657 (Bd. Pat. App. & Inter. 1988)), as a further example ofthe utility of the presently 
claimed polynucleotides, as described in the specification at least at page 12 lines 4-10, the present 
nucleotide sequence has a a>ec^ utility in dete™^ 

human chromosome, for example mapping the protein encoding regions. As evidence supporting 
Applicants assertions of the specific utility ofthe sequences ofthe present invention in localizing the 
stifle region of me human ch^^ 

junctionsis the information provided as Exhibit BB. This is the result of overlaying the sequence of 
SEQIDNOrlofthepresent invention and the identified human genomic sequence. By doing this, one 
isable toidentifytheportionsofthe genome that encode the present invention. As these regions ofthe 
genome are non-contiguous, this is indicative of individual exons. The results of such an analysis 

indicate that the sequence ofthe present invention is the result ofa 16 exon gene contained within the 
BACcloneAC006208.3.Clear^^ 

exons on chromosome 3, one would not have been able to deduce the sequence that encodes the 

moleculesofthepresentinventionwithoutknowingthe sequence. Clearly, the present polynucleotide 
providesexquisite specificit y^ 

gene encoding the given polynucleotide, a utility not shared by virtually any other nucleic acid 
sequences. In fact, it is this specificity that makes this particular sequence so useful. 

Earlygenemappingtechniques relied on methods such as Giemsa staining to identify regions 
of chromosomes. However, such techniques produced genetic maps with a resolution of only 5 to 10 
megabases,fartoolowtobeofmuchh^ 

artisan readily appreciates the significant benefit afforded by markers that map a specific locus ofthe 
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human genome, such as the present nucleic acid sequence. Thus, the present claims clearly meet the 
requirements of 35 U.S.C. § 101. 

Equally significant is that the claimed polynucleotide sequences define how the encoded exons 
are actually spliced together to produce an active transcript (i.e. , the described sequences are useful 
for functionally defining exon splice-junctions). The presently claimed sequence clearly identified the 
intron/exon boundaries, as described above. The specification details that "sequences derived from 
regions adjacent to the intron/exon boundaries of the human gene can be used to design primers for use 
in amplification assays to detect mutations within the exons, introns, splice sites (e.g. , splice acceptor 
and/or donor sites), etc. , that can be used in diagnostics and pharmacogenomics" (specification at page 
8, lines 14-20). Thus, the present claims clearly meet the requirements of 35 U.S.C. § 101. 

Applicants again draw attention to the distinction between the requirements of a specific utility 
with a unique utility. The fact that a small number of other nucleotide sequences could be used to map 
the protein coding regions in this specific region of chromosome 3 does not mean that the use of 
Applicants' sequence to map the protein coding regions of chromosome 3 is not a specific utility (Carl 
Zeiss Stiftung v. Renishaw PLC, supra). 

Finally, while Applicants are well aware of the new Utility Guidelines set forth by the USPTO, 
ithas been long estabhshed mat the current mles regarding the examination of patent applications is and 
always has been the patent laws as set forth in 35 U.S.C. and the patent rules as set forth in 37 C.F.R., 
not the Manual of Patent Examination Procedure or particular guidelines for patent examination set forth 
by the USPTO. Furthermore, it is the job of the judiciary, not the USPTO, to interpret these laws and 
rules. Applicants point out that guidelines that are not consistent with the patent laws, or the 
interpretation of these laws by thejudicial branch, are not the final word in determining whether or not 
claimscomplywithanyparticularsectionoffr^ 

recent changes in either 35 U.S.C. § 101, or in the interpretation of 35U.S.C. § 101 bytheSupreme 
Court or the Federal Circuit that is in keeping with the new Utility Guidelines set forth by the USPTO. 
This is underscored by numerous patents that have been issued over the years that claim nucleic acid 
fragments that do not comply with the new Utility Guidelines. As examples of such issued U.S. Patents, 
the Examiner is invited to review U.S. Patent Nos. 5,817,479, 5,654,173, and 5,552,281 (each of 
which claim short polynucleotides), none of which contain examples of the "real-world" utilities that 



12 



seem to be required in the Action. As issued U.S . Patents are presumed to meet all of the requirements 
for patentability, including 35 U.S.C. §§ 101 and 112, first paragraph (see Section m, below), 
Applicants submit that the presently claimed polynucleotides must also meet the requirements of 
35 U.S.C. § 10 1 . While Applicants understand that each patent application is examined on the basis 
of its individual merits, Applicants are unaware of any changes to 35 U.S.C. § 101, or in the 
interpretation of 35 U.S .C. § 10 1 by the Supreme Court or the Federal Circuit, since the issuance of 
these patents that render the subject matter claimed in these patents, which is similar to the subject 
matter in question in the present application, as suddenly non-statutory or failing to meet the 
requirements of 35 U.S.C. § 101. The requirement of Applicants to meet a different standard of utility 
in the present case would be arbitrary and capricious, and cannot stand. 

In summary, the present situation is similar to Example 10 of the Revised Interim Utility 
Guidelines Training Materials (pages 53-55), which establishes that a rejection under 35 U.S.C. § 101 
as allegedly lacking a patentable utility and under 35 U.S.C. § 1 12, first paragraph as allegedly unusable 
by the skilled artisan due to the alleged lack of patentable utility, is not proper when the full length 
sequence of the invention encodes a protein that has a well known function. Furthermore this response 
has described a series of additional substantial, specific, credible and well-established utilities for the 
present invention in addition to those described in Applicants' many previous responses. Therefore, 
Applicants submit that as the presently claimed sequence molecules have been shown to have a 
substantial, specific, credible and well-established utility, the rejection of the claims under 35 U.S.C. 
§ 101 has been overcome. Thus, Applicants respectfully request that the rejection be withdrawn. 

V. Rejection of Claims Under 35 U.S.C. § 112, First Paragraph 

Clai ms 1-4, 11-12 are al so rejected under 35 U.S.C. § 1 12 first paragraph. Specifically, since 
the claimed invention is not supported by either specific and substantial asserted utility or a well 
established utility for the reasons set forth above, one skilled in the art clearly would not know how to 
use the claimed invention. 

Applicants submit that all claims have been shown to have "a specific, substantial, and credible 
utility", as detailed above. Applicants therefore request that the rejection of all claims under 35 U.S .C. 
§ 1 12, first paragraph, be withdrawn. 
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VI. Rejection of Claims Under 35 U.S.C. § 101 & 35 U.S.C. § 112 

Claims 11-12 are rejected under the above 35 U.S.C. § 101 and 35 U.S.C. £ 112, first 
paragraph, allegedly based on the same reasoning as above. As claims 1 1 and 12 are dependent upon 
Claim 4 and Claim 4 has now been shown to have a patentable utility under 35 U.S.C. Sections 101 
and 112, this rejection has been avoided and Applicants, therefore, request withdrawal of the 
rejection. 

VII. Conclusion 

The present document is a full and complete response to the Action. In conclusion, Applicants 
submit that, in light of the foregoing remarks, the present case is in condition for allowance, and such 
favorable action is respectfully requested. Should Examiner Chism have any questions or comments, 
or believe that certain amendments of the claims might serve to improve their clarity, a telephone call 
to the undersigned Applicants' representative is earnestly solicited. 



Respectfully submitted, 



September 22. 2003 
Date 




LEXICON GENETICS INCORPORATED 
(281) 863-3333 



Customer # 24231 
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SEP 2 6 2003 

^ . ,«sr • 

^^TaAnS&^P^ Genomic Sequences p a „_ i _ f <r 

Exhibit AA PilL 

FASTA searches a protein or DNA sequence data bank ^ 
version 3.3t05 March 3 0, 2000 »tP 3* 

Please cite: 

W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 fcfcfc,* 

***** 

/tmp/fastaCAAWTaiDv: 2349 nt 
>LEX151 SEQ ID NO: 3 
vs /tmp/fastaDAAXTaiDv library 
searching /tmp/fastaDAAXTaiDv library 

4700 residues in 1 sequences 

FASTA (3.34 January 2000) function [optimized, +5/-4 matrix (5:-4)] ktup- 6 
join: 74, opt: 59, gap-pen: -16/ -4, width: 16 
Scan time: 0.100 
The best scores are: opt 
gi | 8978201 |dbj |AB029496.1| Homo sapiens mRNA f (4700) [f] 11742 
gi | 8978201 | dbj (AB029496 . 1 j Homo sapiens mRNA f (4700) [r] 78 

»gi|8978201|dbj |AB029496.1| Homo sapiens mRNA for semap (4700 nt) 
initn: 11742 initl: 11742 opt: 11742 
99.957% identity in 2349 nt overlap (1-2349:1-2349) 

10 20 30 40 50 60 

LEX151 ATGGCCCCCTCGGCCTGGGCCATTTGCTGGCTGCTAGGGGGCCTCCTGCTCCATGGGGGT 

gi | 897 ATGGCCCCCTCGGCCTGGGCCATTTGCTGGCTGCTAGGGGGCCTC^ 

10 20 30 40 50 60 

70 80 90 100 110 120 

LEX151 AGCTCTGGCCCCAGCCCCGGCCCCAGTGTGCCCCGCCTGCGGCTCTCCTACCGAGACCTC 

gi | 897 AGCTCTGGCCCCAGCCCCGGCCCCAGTGTGCCCCGCOT 

70 80 90 100 110 120 

130 140 150 160 170 180 

LEX151 CTGTCTGCCAACCGCTCTGCCATCTTTCTGGGCCCCCAGGGCTCCCTGAACCTCCAGGCC 

gi | 897 CTGTCTGCCAACCGCTCTGCCATCTTTCTGGGCCCCCAGGGOT 

130 * 140 150 160 170 180 

190 200 210 220 230 240 

JLEX151~ATGTAGCTAGATGAGTACPfiAaAPrf2P^ 



gi | 897 ATGTACCTAGATGAGTACCGAGACCGCCTCTTTCTGGGTG^ 

190 200 210 220 230 240 

250 260 270 280 290 300 

LEX151 CTGCGGCTGGACCAGGCATGGCCAGATCCCCGGGAGGTCCTGTGGCCACCGCAGCCAGGA 

gi | 8 9 7 CTGCGGCTGGACCAGGCATGGCCAGATCCCCGGGAGGTCCTGTGGCCACCGCAGCCAGGA 
250 260 270 280 290 300 

310 3 20 330 340 350 360 

LEX1 5 1 CAGAGGGAGGAGTGTGTTCGAAAGGGAAGAGATCCTTTGACAGAGTGCGCCAACTTCGTG 

gi | 897 CAGAGGGAGGAGTGTGTTCGAAAGGGAAGAGATCCTTTGACAGAGTGCGCCAACTTCCT 
310 3 20 330 340 350 360 



37 ° 380 390 400 410 



420 
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gi | 897 CGGGTGCTACAGCCTCACAACCGGACCCACCTGCTAGCCTGTGGCACTGGGGCCTTCCAG 
370 380 390 400 410 420 

430 440 450 460 470 480 

LEX151 CCCACCTGTGCCCTCATCACAGTTGGCCACCGTGGGGAGCATGTGCTCCACCTGGAGCCT 



gi | 897 CCCACCTGTGCCCTCATCACAGTTGGCCACCGTGGGGAGCATGTGCTCCACCTGGAGCCT 
430 440 450 460 470 480 

490 500 510 520 530 540 

LEX151 GGCAGTGTGGAAAGTGGCCGGGGGCGGTGCCCTCACGAGCCCAGCCGTCCCTTTGCCAGC 



gi | 897 GGCAGTGTGGAAAGTGGCCGGGGGCGGTGCCCTCACGAGCCCAGCCGTCCCTTTGCCAGC 
490 500 510 520 530 540 

550 560 570 580 590 600 

LEX1 5 1 ACCTTCATAGACGGGGAGCTGTACACGGGTCTCACTGCTGACTTCCTGGGGCGAGAGGCC 



gi | 897 ACCTTCATAGACGGGGAGCTGTACACGGGTCTCACTGCTGACTTCCTGGGGCGAGAGGCC 
550 560 570 580 590 600 

610 620 630 640 650 660 

LEX151 ATGATCTTCCGAAGTGGAGGTCCTCGGCCAGCTCTGCGTTCCGACTCTGACCAGAGTCTC 



gi | 897 ATGATCTTCCGAAGTGGAGGTCCTCGGCCAGCTCTGCGTTCCGACTCTGACCAGAGTCTC 
610 620 630 640 650 660 

670 680 690 700 710 720 

LEX1 5 1 TTGCACGACCCCCGGTTTGTGATGGCCGCCCGGATCCCTGAGAACTCTGACCAGGACAAT 



gi | 897 TTGCACGACCCCCGGTTTGTGATGGCCGCCCGGATCCCTGAGAACTCTGACCAGGACAAT 
670 680 690 700 710 720 

730 740 750 760 770 780 

LEX151 GACAAGGTGTACTTCTTCTTCTCGGAGACGGTCCCCTCGCCCGATGGTGGCTCGAACCAT 



gi | 897 GACAAGGTGTACTTCTTCTTCTCGGAGACGGTCCCCTCGCCCGATGGTGGCTCGAACCAT 
730 * 740 750 760 770 780 

790 800 810 820 830 840 

IjEX-l-5-l--GTeAeTGTCAGCCGCGTGGGCCGCGTCTGCGTGAATGATGCTGGGGGCCAGCGGGTGCTG" 



gi | 897 GTC AC TGTC AGC CGC GTGGGC CGCGTC TGCGTGAATGATGCTGGGGGC C AGCGGGTGC TG 
790 800 810 820 830 840 

850 860 870 880 890 900 

LEX151 GTGAACAAATGGAGGACTTTCCTCAAGGCCAGGCTGGTCTGCTCGGTGCCCGGCCCTGGT 



gi | 897 GTGAACAAATGGAGCACTTTCCTCAAGGCCAGGCTGGTCTGCTCGGTGCCCGGCCCTGGT 
850 860 870 880 890 900 

910 920 930 940 950 960 

LEX1 5 1 GGTGCCGAGACCCACTTTGACCAGCTAGAGGATGTGTTCCTGCTGTGGCCCAAGGCCGGG 

gi | 897 GGTGCCGAGACCCACTTTGACCAGCTAGAGGATGTGTTCCTGCTGTGGCCCAAGGCCGGG 
910 920 930 940 950 960 

97 0 980 990 1000 1010 102 0 
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gi | 897 AAGAGCCTCGAGGTGTACGCGCTGTTCAGCACCGTCAGTGCCGTGTTCCAGGGCTTCGCC 
970 980 990 1000 1010 1020 

1030 1040 1050 1060 1070 1080 

LEXl 5 1 GTCTGTGTGTACCACATGGCAGACATCTGGGAGGTTTTCAACGGGCCCTTTGCCCACCGA 



gi | 897 GTCTGTGTGTACCACATGGCAGACATCTGGGAGGTTTTCAACGGGCCCTTTGCCCACCGA 
1030 1040 1050 1060 1070 1080 

1090 1100 1110 1120 1130 1140 

LEXl 5 1 GATGGGCCTCAGC ACCAGTGGGGGCCCTATGGGGGCAAGGTGCCCTTCCCTCGCCCTGGC 



gi | 897 GATGGGCCTCAGCACCAGTGGGGGCCCTATGGGGGCAAGGTGCCCTTCCCTCGCCCTGGC 
1090 1100 1110 1120 1130 1140 

1150 1160 1170 1180 1190 1200 

LEXl 5 1 GTGTGCCCCAGCAAGATGACCGCACAGCCAGGACGGCCTTTTGGCAGCACCAAGGACTAC 



gi | 897 GTGTGC CC C AGC AAGATG AC C GC AC AGC C AGGAC GGCCTTTTGGC AGC ACC AAGG AC T AC 
1150 1160 1170 1180 1190 1200 

1210 1220 1230 1240 1250 1260 

LEXl 51 CCAGATGAGGTGCTGCAGTTTGCCCGAGCCCACCCCCTCATGTTCTGGCCTGTGCGGCCT 



gi | 897 CCAGATGAGGTGCTGCAGTTTGCCCGAGCCCACCCCCTCATGTTCTGGCCTGTGCGGCCT 
1210 . 1220 1230 1240 1250 1260 

1270 1280 1290 1300 1310 1320 

LEXl 5 1 CGACATGGCCGCCCTGTCCTTGTCAAGACCCACCTGGCCCAGCAGCTACACCAGATCGTG 



gi | 897 CGACATGGCCGCCCTGTCCTTGTCAAGACCCACCTGGCCCAGCAGCTACACCAGATCGTG 
1270 1280 1290 1300 1310 1320 

1330 1340 1350 1360 1370 1380 

LEX151 GTGGACCGCGTGGAGGCAGAGGATGGGACCTACGATGTCATTTTCCTGGGGACTGACTCA 

gi | 897 GTGGACCGCGTGGAGGCAGAGGATGGGACCTACGATGTCATTTTCCTGGGGACTGACTCA 
1330* 1340 1350 1360 1370 1380 

1390 1400 1410 1420 1430 1440 

~LEX1"51"~GGGTCTGTGCTCAAAGTCATCGCTCTCCAGGCAGGGGGCTCAGCTGAACCTGAGGAAGTG~ 

gi | 897 GGGTCTGTGC TC AAAGTC ATC GCTC TCC AGGC AGGGGGC TC AGC TGAAC C TG AGG AAGTG 
1390 1400 1410 1420 1430 1440 

1450 1460 1470 1480 1490 1500 

LEXl 5 1 GTTCTGGAGG AGC TC C AGGTGTTTAAGGTGCC AAC AC C T ATC AC C G AAATGG AG ATC TCT 

gi | 897 GTTCTGGAGGAGCTCCAGGTGTTTAAGGTGCCAACACCTATCACCGAAATGGAGATCTCT 
1450 1460 1470 1480 1490 1500 

1510 1520 1530 1540 1550 1560 

LEXl 5 1 GTCAAAAGGCAAATGCTATACGTGGGCTCTCGGCTGGGTGTGGCCCAGCTGCGGCTGCAC 

gi | 897 GTCAAAAGGCAAATGCTATACGTGGGCTCTCGGCTGGGTGTGGCCCAGCTGCGGCTGCAC 
1510 1520 1530 1540 1550 1560 

1570 1580 1590 1600 1610 1620 
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LEXl 5 1 CAATGTGAGACTTACGGCACTGCCTGTGCAGAGTGCTGCCTGGCCCGGGACCCATACTGT 

gi | 897 CAATGTGAGACTTACGGCACTGCCTGTGCAGAGTGCTGCCTGGCCCGGGACCCATACTGT 
1570 1580 1590 1600 1610 1620 

1630 1640 1650 1660 1670 1680 

LEXl 51 GCCTGGGATGGTGCCTCCTGTACCCACTACCGCCCCAGCCTTGGCAAGCGCCGGTTCCGC 

gi | 897 GCCTGGGATGGTGCCTCCTGTACCCACTACCGCCCCAGCCTTGGCAAGCGCCGGTTCCGC 
1630 1640 1650 1660 1670 1680 

1690 1700 1710 1720 1730. 1740 

LEXl 5 1 CGGCAGGACATCCGGCACGGCAACCCTGCCCTGCAGTGCCTGGGCCAGAGCCAGGAAGAA 
*•••••**•****••********«••••••• ••••••••#...;. 

gi | 897 CGGCAGGACATCCGGCACGGCAACCCTGCCCTGCAGTGCCTGGGCCAGAGCCAGGAAGAA 

1690 1700 1710 1720 1730 1740 

1750 1760 1770 1780 1790 1800 

LEXl 5 1 GAGGC AGTGGGAC TTGTGGC AGC C AC C ATGGTC T AC GGC ACGG AGC AC AATAGC ACCTTC 
— 

gi | 897 GAGGCAGTGGGACTTGTGGCAGCCACC ATGGTC TACGGC AC GGAGC AC AATAGC ACCTTC 
1750 1760 1770 1780 1790 1800 

1810 1820 1830 1840 1850 1860 

LEXl 5 1 CTGGAGTGCCTGCCCAAGTCTCCCCARGCTGCTGTGCGCTGGCTCTTGCAGAGGCCAGGG 

gi | 897 CTGGAGTGCCTGCCCAAGTCTCCCCAGGCTGCTGTGCGCTGGCTCTTGCAGAGGCCAGGG 
1810 1820 1830 1840 1850 1860 

1870 1880 1890 1900 1910 1920 

LEX 15 1 GATGAGGGGCCTGACCAGGTGAAGACGGACGAGCGAGTCTTGCACACGGAGCGGGGGCTG 

gi | 897 G ATG AGGGGC C TGAC C AGGTG AAGAC GGAC G AGC G AGTC TTGC AC ACGGAGC GGGGGCTG 
1870 1880 1890 1900 1910 1920 

1930 1940 1950 1960 1970 1980 

LEXl 5 1 CTGTTCCGCAGGCTTAGCCGTTTCGATGCGGGCACCTACACCTGCACCACTCTGGAGCAT 

gi | 897 CTGTTCCGCAGGCTTAGCCGTTTCGATGCGGGCACCTACACCTGCACCACTCTGGAGCAT 
1930 • 1940 1950 1960 1970 1980 

1990 2000 2010 2020 2030 2040 

~LEX 1"5~1~~GGC TTCTC CC AGAC TGTGGTC CGCC TGGC TC TGGTGGTGATTGTGGCCTC ACAGC TGGAC 

gi | 897 GGC TTC TCCC AGAC TGTGGTCC GC C TGGCTCTGGTGGTGATTGTGGCC TCAC AGC TGGAC 
1990 2000 2010 2020 2030 2040 

2050 2060 2070 2080 2090 2100 

LEXl 51 AACCTGTTCCCTCCGGAGCCAAAGCCAGAGGAGCCCCCAGCCCGGGGAGGCCTGGCTTCC 

gi | 897 AACCTGTTCCCTCCGGAGCCAAAGCCAGAGGAGCCCCCAG 

2050 2060 2070 2080 2090 2100 

2110 2120 2130 2140 2150 2160 

LEXl 5 1 ACCCCACCCAAGGCCTGGTACAAGGACATCCTGCAGCTCATTGGCTTCGCCAACCTGCCC 

gi | 897 ACCCCACCCAAGGCCTGGTACAAGGACATCCTGCAGCTCATTGGCTTCGCCAA 

2110 2120 2130 2140 2150 2160 

2170 2180 2190 2200 2210 2220 
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LEXl 5 1 CGGGTGGATGAGTACTGTGAGCGCGTGTGGTGC AGGGGCACC ACGGAATGCTC AGGCTGC 

gi I 897 CGGGTGGATGAGTACTGTGAGCGCGTGTGGTGC AGGGGCACC ACGGAATGCTC AGGCTGC 
2170 2180 2190 2200 2210 2220 

2230 2240 2250 2260 2270 2280 

LEXl 5 1 TTCCGGAGCCGGAGCCGGGGCAAGCAGGCCAGGGGCAAGAGCTGGGCAGGGCTGGAGCTA 



gi | 897 TTCCGGAGC C GG AGC C GGGGC AAGC AGGCC AGGGGC AAGAGC TGGGC AGGGC TGGAGC TA 
2230 2240 2250 2260 2270 2280 

2290 2300 2310 2320 2330 2340 

■ LEX151 GGC AAGAAGATGAAGAGCCGGGTGC ATGCCGAGC AC AATC GGACGC CC C GGGAGGTGGAG 



gi | 897 GGCAAGAAGATGAAGAGCCGGGTGCATGCCGAGCACAATCGGACGCCCC GGGAGGTGGAG 
2290 2300 2310 2320 2330 2340 



LEXl 51 GCCACGTAG 



gi | 897 GC C AC GT AG AAGGGGGC AG AGGAGGGGTGGTC AGGATGGGC TGGGGGGC C C AC T AGC AGC 
2350 2360 2370 2380 2390 2400 

»gi | 8978201 | dbj | AB029496 . 1 | Homo sapiens mRNA for semap (4700 nt) 
rev-comp initn: 136 initl: 78 opt: 78 
85.714% identity in 21 nt overlap (875-855:476-496) 

900 890 880 870 860 850 

LEXl 5 - GCACCACCAGGGCCGGGCACCGAGCAGACCAGCCTGGCCTTGAGGAAAGTGCTCCATTTG 



gi | 897 GCCACCGTGGGGAGCATGTGCTCCACC TGGAGC CTGGCAGTGTGGAAAGTGGCCGGGGGC 
450 460 470 480 490 500 

840 830 820 810 800 790 

. LEX15- TTCACCAGCACCCGCTGGCCCCCAGCATCATTCACGCAGACGCGGCCCACGCGGCTGACA * 

gi | 897 GGTGCCCTCACGAGCCCAGCCGTCCCTTTGCCAGCACCTTCATAGACGGGGAGCTGTACA 
510 520 530 540 550 560 



2349 residues in 1 query sequences 

4700 residues in 1 library sequences 
Scomplib [version 3.3t05 March 30, 2000] 

start: Fri Sep 19 13:51:42 2003 done: Fri Sep 19 13:51:42 2003 
Scan time: 0.100 Display time: 0.150 

Function used was FASTA 
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:gctcagg 

cacagacUHPTacgc 

Nucleotide Protein 



CTTCG-ggf GCT AG^SS ATCGG ATC CCCGGOSE flfcrJATTATATAGC TCGATCGATC1 
I^CG'liP'-OGr-^ 

^tLiQn .^^Tfr TT f G Git TA C vjTGT^t f 'lAll t#l VH^I ; fe- 1 rf^T C 




TCAC1 

Genome 



ITC GCATACGT 
TACTAAC 
. Structure 



KcTTAC TAAC C AA7rft(t aG^Gv? 

PMC 




Taxonomy OMIM 



Go 
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PubMed 
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□ 1: AB029496 . Homo sapiens mRNA...[gi:8978201] 



Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



FEATURES 

source 



gene 



CDS 



AB029496 4700 bp mRNA linear PRI 07-JUL-2000 

Homo sapiens mRNA for semaphorin sem2 , complete cds . 

AB029496 

AB029496.1 GI: 8978201 
semaphorin sem2 . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 4700) 

Seki,N., Hat tori, A., Hayashi,A. , Kozuma,S., Muramatsu, M. , 
Miyajima,N. and Saito,T. 
Human semaphorin 

Published Only in DataBase (2 000) 

2 (bases 1 to 4700) 

Seki,N., Hattori,A., Hayashi,A., Kozuma,S., Muramatsu, M. , 
Miyajima,N. and Saito,T. 
Direct Submission 

Submitted ( 01-JUL-1999 ) Toshiyuki Saito, National Institute of 
Radiological Sciences, Genome Research Group; Inage-ku Anagawa 
4-9-1, Chiba, Chiba 263-8555, Japan (E-mail : t_saito@nirs . go . jp, 
Tel: 81-43-201-3135, Fax:81-43-251-9818) 

Location/Qualifiers 

1. .4700 

/organism="Homo sapiens" 

/mol_type= " mRNA " 

/db_xref = " t axon : 9 6 0 6 " 

1. .4700 

/gene= " sem2 " 

1. .2349 

/gene= " sem2 " 

/codon_start=l 

/ p r oduc t = " s emaphor in s em2 11 

/protein_id= " BAA98132 . 1 " 

/db_xref ="GI : 8978202 » 

/ trans la tion= M MAPS AWAICWLLGGLLLHGGSSGPSPGPSVPRLRLSYRDLLS AN 
RSAIFLGPQGSLNLQAMYLDEYRDRLFLGGLDALYSLRLDQAWPDPREVLWPPQPGQR 
EECVRKGRDPLTECANFVRVLQPHNRTHLLACGTGAFQPTCALITVGHRGEHVLHLEP 
GSVESGRGRCPHEPSRPFASTFIDGELYTGLTADFLGREAMIFRSGGPRPALRSDSDQ 
SLLHDPRFVMAARIPENSDQDNDKVYFFFSETVPSPDGGSNHVTVSRVGRVCVNDAGG 
QRVLVNKWSTFLKARLVCSVPGPGGAETHFDQLEDVFLLWPKAGKSLEVYALFSTVSA 
VFQGFAVCVYHMADIWEVFNGPFAHRDGPQHQWGPYGGKVPFPRPGVCPSKMTAQPGR 
PFGSTKDYPDEVLQFARAHPLMFWPVRPRHGRPVLVKTHLAQQLHQIWDRVEAEDGT 
YDVIFLGTDSGSVLKVIALQAGGSAEPEEWLEELQVFKVPTPITEMEISVKRQMLYV 
GSRLGVAQLRLHQCETYGTACAECCLARDPYCAWDGASCTHYRPSLGKRRFRRQDIRH 
GNPALQCLGQSQEEEAVGLVAATMVYGTEHNSTFLECLPKSPQAAVRWLLQRPGDEGP 
DQVKTDERVLHTERGLLFRRLSRFDAGTYTCTTLEHGFSQTWRLALWIVASQLDNL 
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FPPEPKPEEPPARGGLASTPPKAWYKDILQLIGFANLPRVDEYCERVWCRGTTECSGC 
FRSRSRGKQARGKSWAGLELGKKMKSRVHAEHNRTPREVEAT " 

BASE COUNT 972 a 1307 c 1467 g 954 t 

ORIGIN 

1 atggccccct cggcctgggc catttgctgg ctgctagggg gcctcctgct ccatgggggt 
61 agctctggcc ccagccccgg ccccagtgtg ccccgcctgc ggctctccta ccgagacctc 
121 ctgtctgcca accgctctgc catctttctg ggcccccagg gctccctgaa cctccaggcc 
181 atgtacctag atgagtaccg agaccgcctc tttctgggtg gcctggacgc cctctactct 
241 ctgcggctgg accaggcatg gccagatccc cgggaggtcc tgtggccacc gcagccagga 
3 01 cagagggagg agtgtgttcg aaagggaaga gatcctttga cagagtgcgc caacttcgtg 
361 cgggtgctac agcctcacaa ccggacccac ctgctagcct gtggcactgg ggccttccag 
421 cccacctgtg ccctcatcac agttggccac cgtggggagc atgtgctcca cctggagcct 
481- ggcagtgtgg aaagtggccg ggggcggtgc cctcacgagc ccagccgtcc ctttgccagc 
541 accttcatag acggggagct gtacacgggt ctcactgctg acttcctggg gcgagaggcc 
601 atgatcttcc gaagtggagg tcctcggcca gctctgcgtt ccgactctga ccagagtctc 
661 ttgcacgacc cccggtttgt gatggccgcc cggatccctg agaactctga ccaggacaat 
721 gacaaggtgt acttcttctt ctcggagacg gtcccctcgc ccgatggtgg ctcgaaccat 
781 gtcactgtca gccgcgtggg ccgcgtctgc gtgaatgatg ctgggggcca gcgggtgctg 
841 gtgaacaaat ggagcacttt cctcaaggcc aggctggtct gctcggtgcc cggccctggt 
901 ggtgccgaga cccactttga ccagctagag gatgtgttcc tgctgtggcc caaggccggg 
9 61 aagagcctcg aggtgtacgc gctgttcagc accgtcagtg ccgtgttcca gggcttcgcc 
1021 gtctgtgtgt accacatggc agacatctgg gaggttttca acgggccctt tgcccaccga 
1081 gatgggcctc agcaccagtg ggggccctat gggggcaagg tgcccttccc tcgccctggc 
1141 gtgtgcccca gcaagatgac cgcacagcca ggacggcctt ttggcagcac caaggactac 
12 01 ccagatgagg tgctgcagtt tgcccgagcc caccccctca tgttctggcc tgtgcggcct 
12 61 cgacatggcc gccctgtcct tgtcaagacc cacctggccc agcagctaca ccagatcgtg 
1321 gtggaccgcg tggaggcaga ggatgggacc tacgatgtca ttttcctggg gactgactca 
1381 gggtctgtgc tcaaagtcat cgctctccag gcagggggct cagctgaacc tgaggaagtg 
1441 gttctggagg agctccaggt gtttaaggtg ccaacaccta tcaccgaaat ggagatctct 
1501 gtcaaaaggc aaatgctata cgtgggctct cggctgggtg tggcccagct gcggctgcac 
1561 caatgtgaga cttacggcac tgcctgtgca gagtgctgcc tggcccggga cccatactgt 
1621 gcctgggatg gtgcctcctg tacccactac cgccccagcc ttggcaagcg ccggttccgc 
1681 cggcaggaca tccggcacgg caaccctgcc ctgcagtgcc tgggccagag ccaggaagaa 
1741 gaggcagtgg gacttgtggc agccaccatg gtctacggca cggagcacaa tagcaccttc 
1801 ctggagtgcc tgcccaagtc tccccaggct gctgtgcgct ggctcttgca gaggccaggg 
1861 gatgaggggc ctgaccaggt gaagacggac gagcgagtct tgcacacgga gcgggggctg 
1921 ctgttccgca ggcttagccg tttcgatgcg ggcacctaca cctgcaccac tctggagcat 
1981 ggcttctccc agactgtggt ccgcctggct ctggtggtga ttgtggcctc acagctggac 
2 041 aacctgttcc ctccggagcc aaagccagag gagcccccag cccggggagg cctggcttcc 
2101 accccacoca aggcctggta caaggacatc ctgcagctca ttggcttcgc caacctgccc 
2161 cgggtggatg agtactgtga gcgcgtgtgg tgcaggggca ccacggaatg ctcaggctgc 
2221 ttccggagcc ggagccgggg caagcaggcc aggggcaaga gctgggcagg gctggagcta 
2281 ggcaagaaga tgaagagccg ggtgcatgcc gagcacaatc ggacgccccg ggaggtggag 
2341 gccacgtaga agggggcaga ggaggggtgg tcaggatggg ctggggggcc cactagcagc 
2401 ccccagcatc tcccacccac ccagctaggg cagaggggtc aggatgtctg tttgcctctt 
2461 agagacaggt gtctctgccc ccacaccgct actggggtct aatggagggg ctgggttctt 
2521 gaagcctgtt ccctgccctt ctctgtgctc ttagacccag ctggagccag caccctctgg 
2581 ctgctggcag ccccaaggga tctgccattt gttctcagag atggcctggc ttccgcaaca 
2641 catttccggg tgtgcccaga ggcaagaggg ttgggtggtt ctttcccagc ctacagaaca 
2701 atggccattc tgagtgaccc tcagagtggg tgtgtgggtg cgtctagggg gtatcccggt 
2761 agggggcctg cagggagcca gagggtggaa atggcctcta agctagcacc ccgtaagaag 
2821 agcctacctg accgacttgg ggagggaaca cagaggtgtt gggaaggtgg agcaacaatg 
2881 cacctcccct cctgtcgcgc cgtgatatct tggtggctcc ctgccactgc ccaccgcctc 
2941 ttctccatct gagaatcacg gagaggtgta gataatctag aggcatagac tgctagagcc 
3001 cccagggatc tggggtggtc agggctcagg cttcactttg taaaccaggt gggggcatct 
3061 cacagcctga cttcccttcc ccaggccagg gttgctggga tgcctgcccc tcctgagagg 
3121 accccctccc cattgtcagg ctctccatgt ccacgagcgg ggaggggtgg gttctggggc 
3181 attgttgtcc cttgtgtctg tggactagag atagggtggg ggagctgggg aagggtgcag 
3241 gcgggaagag tgggctgtct ttcccagggt gatgcaagca tgccgcagcc ctggaggctg 
33 01 ggaatgtgga ggctctgtga gccctgcagc cctcagaatc agggccaggg atgcagaaga 
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3361 ttgagaggat atggagatgg atagagggca 
3421 ggcaggaaca ggtgtccaca agaactcagg 
3481 aagacccagt gtttccatct ctggaatctc 
3541 gtttttcttt taagggggaa acaaggtaga 
3 601 attctcgggg gtaaggctcg gatggcaagg 
3661 gtttttgcca tcaccagttt ctcaggctgg 
3721 gaaaagttgt tcccagcctg cacatgaaca 
3781 agataagagc actgggtttg agattccctc 
3841 ggaaaaaaag ataaaaagca agccagggtt 
3901 gccttgggtt ttatctttcc cttacccctg 
3961 tccctccgtt ctcccctttg accatgtaat 
4021 tcaacgccct gagaagcctt ccagcctgcg 
4081 ggcagaggag gagaggagga aaggatgggg 
4141 gggattaaag aggggaggag agagtgcaga 
42 01 agctctgccc tctaccctag ggaggccaga 
4261 ctggactctg gcttggcagg ctccaggcag 
4321 agggaggagg agcacaagat cctcagcaac 
4381 ttctgccaac cacaccctac ccatggtact 
4441 aaatgcgagt tgtttttgta tttgtgtgtt 
4501 gagcacattt cttgtaatta ctattgttat 
45 61 gtgagaaaag ctgaatttac aaggaaaggg 
4621 atcattactg tgtatctgtg tattgtacta 
4681 aatgaagagc cctcccatcc 



ggagaccctt aggatagatt gtgggaccca 
atggcatcag ttagctcaga agccacctgg 
tgttttatgc taaatggatt taggaagact 
gaaaaggacg aagaagtgta agtcccgctg 
acgcgttctg cctgggcatg taggggaggt 
ggagcacaga ggggaggagg aggactaaat 
cattcatgac acacaaaact ggctggaagg 
cattaaaaca accaagacaa agaaaggagg 
ccctgcccta ttgaaactca aacccagact 
gcacctccag agaactggga cctgaaatag 
aaatgaacca gaagcactga gattaaccta 
gtgctgtctg ctgggaggtc agctggtcaa 
gctgaagagc agaagggagg ggagacagag 
gctccaggaa agggtatcag agctgcagcc 
aagacacaaa cagccctccg ggcctttacg 
ggtcctctgg gaagttactc tagaaaacga 
gaacacctgc acttagaaaa agtggacagc 
gtatgctatt aactcctgga aacgccccgt 
gagatgggcc ttgtggtttc tctgtactca 
ttttattgtc atgactgccc ctgagctctg 
atgaagttaa tatttgcatc acataattat 
aatggactga tgctgcgcac atgagctgaa 
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FASTA searches a protein or DNA sequence data bank 
version 3.3t05 March 30, 2000 
Please cite: 

W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 

/tmp/f as taGAALlaqDv: 2 628 nt 

>LEX151 SEQ ID NO:l 

vs /tmp/f astaHAAMlaqDv library 
searching /tmp/f astaHAAMlaqDv library 

4700 residues in 1 sequences 

FASTA (3.-34 January 2000) function [optimized, +5A-4 matrix ( 5 : -4 ) ] ktup : 6 

join: 77, opt: 62, gap-pen: -16/ -4, width: 16 

Scan time: 0.117 
The best scores are: opt 
gi|897820l|dbj |AB029496.1| Homo sapiens mRNA f (4700) [f] 11742 
gi | 8978201 jdbj |AB029496.1| Homo sapiens mRNA f (4700) [r] 95 

»gi | 8978201 |dbj | AB029496 . 1 1 Homo sapiens mRNA for semap (4700 nt) 
initn: 11742 initl: 11742 opt: 11742 

99.957% identity in 2349 nt overlap (280-2628:1-2349) 

250 260 270 280 290 300 

LEX1 5 1 AGGCGGCAGCGGTGCCCTCAGTTCCCCAGCATGGCCCCCTCGGCCTGGGCCATTTGCTGG 



gi | 897 ATGGCCCCCTCGGCCTGGGCCATTTGCTGG 

10 20 30 

310 320 330 340 350 360 

IjEXISI CTGCTAGGGGGCCTCCTGCTCCATGGGGGTAGCTCTGGCCCCAGCCCCGGCCCCAGTGTG 

gi I 897 CTGCTAGGGGGCCTCCTGCTCCATGGGGGTAGCTCTGGCCCCAGCCCCGGCCCCAGTGTG 
40 50 60 70 80 90 

370 380 390 400 410 420 

LEX151 CCCCGCCTGCGGCTCTCCTACCGAGACCTCCTGTCTGCCAACCGCTCTGCCATCTTTCTG 

gi | 897 CCCCGCCTGCGGCTCTCCTACCGAGACCTCCTGTCTGCCAACCGCTCTGCCATCTTTCTG* 
100 - 110 120 130 140 150 

430 440 450 460 470 480 

~LEXr51~GGCCCCCAGGGCTCCCTGAACCTCCAGGCCATGTACCTAGATGAGTACCGAGACCGCCTC- 

gi | 897 GGCCCCCAGGGCTCCCTGAACCTCCAGGCCATGTACCTAGATGAGTACCGAGACCGCCTC 
160 170 180 190 200 210 

490 500 510 520 530 540 

LEX1 5 1 TTTCTGGGTGGCCTGGACGCCCTCTACTCTCTGCGGCTGGACCAGGCATGGCCAGATCCC 

gi | 897 TTTCTGGGTGGCCTGGACGCCCTCTACTCTCTGCGGCTGGACCAGGCATGGCCAGATCCC 
220 230 240 250 260 270 

550 560 570 580 590 600 

LEXl 5 1 CGGGAGGTCCTGTGGCCACCGCAGCCAGGACAGAGGGAGGAGTGTGTTCGAAAGGGAAGA 

gi | 897 CGGGAGGTCCTGTGGCCACCGCAGCCAGGACAGAGGGAGGAGTGTGTTCGAAAGGGAAGA 
280 290 300 310 320 330 

610 620 630 640 650 660 
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LEXl 5 1 GATCCTTTGACAGAGTGCGCCAACTTCGTGCGGGTGCTACAGCCTCACAACCGGACCCAC 



gi | 897 GATCCTTTGACAGAGTGCGCCAACTTCGTGCGGGTGCTACAGCCTCACAACCGGACCCAC 
340 350 360 370 380 390 

670 680 690 700 710 720 

LEXl 5 1 CTGCTAGCCTGTGGCACTGGGGCCTTCCAGCCCACCTGTGCCCTCATCACAGTTGGCCAC 



gi | 897 CTGCTAGCCTGTGGCACTGGGGCCTTCCAGCCCACCTGTGCCCTCATCACAGTTGGCCAC 
400 410 420 430 440 450 

730 740 750 760 770 780 

LEX151 CGTGGGGAGCATGTGCTCCACCTGGAGCCTGGCAGTGTGGAAAGTGGCCGGGGGCGGTGC 



gi | 897 CGTGGGGAGCATGTGCTCCACCTGGAGCCTGGCAGTGTGGAAAGTGGCCGGGGGCGGTGC 
460 470 480 490 500 510 

790 800 810 820 830 840 

LEX151 CCTCACGAGCCCAGCCGTCCCTTTGCCAGCACCTTCATAGACGGGGAGCTGTACACGGGT 



gi | 897 CCTCACGAGCCCAGCCGTCCCTTTGCCAGCACCTTCATAGACGGGGAGCTGTACACGGGT 
520 530 540 550 560 570 

850 860 870 880 890 900 

LEXl 5 1 CTCACTGCTGACTTCCTGGGGCGAGAGGCCATGATCTTCCGAAGTGGAGGTCCTCGGCCA 



gi | 897 CTCACTGCTGACTTCCTGGGGCGAGAGGCCATGATCTTCCGAAGTGGAGGTCCTCGGCCA 
580 590 600 610 620 630 

910 920 930 940 950 960 

LEX151 GCTCTGCGTTCCGACTCTGACCAGAGTCTCTTGCACGACCCCCGGTTTGTGATGGCCGCC 



gi | 897 GCTCTGCGTTCCGACTCTGACCAGAGTCTCTTGCACGACCCCCGGTTTGTGATGGCCGCC 
640 650 660 670 680 690 

970 980 990 1000 1010 1020 

LEXl 5 1 CGGATCCC TGAGAAC TC TGACC AGGAC AATGAC AAGGTGTAC TTC TTCTTCTC GGAGAC G 



gi | 897 CGGATCCCTGAGAACTCTGACCAGGACAATGACAAGGTGTACTTCTTCTTCTCGGAGACG 
700- 710 720 730 740 750 

1030 1040 1050 1060 1070 1080 

LEXl 5 1 GTCCCCTCGCCCGATGGTGGCTCGAACCATGTCACTGTCAGCCGCGTGGGCCGCGTCTGC 



gi | 897 GTCCCCTCGCCCGATGGTGGCTCGAACCATGTCACTGTCAGCCGCGTGGGCCGCGTCTGC 
760 770 780 790 800 810 

1090 1100 1110 1120 1130 1140 

LEXl 5 1 GTGAATGATGCTGGGGGCCAGCGGGTGCTGGTGAACAAATGGAGCACTTTCCTCAAGGCC 



gi | 897 GTGAATG ATGC TGGGGGCC AGCGGGTGC TGGTG AAC AAATGG AGC AC TTTC C TC AAGGCC 
820 830 840 850 860 870 

1150 1160 1170 1180 1190 1200 

LEXl 51 AGGCTGGTCTGCTCGGTGCCCGGCCCTGGTGGTGCCGAGACCCACTTTGACCAGCTAGAG 



gi | 897 AGGCTGGTCTGCTCGGTGCCCGGCCCTGGTGGTGCCGAGACCCACTTTGACCAGCTAGAG 
880 890 900 910 920 930 

1210 1220 1230 1240 1250 1260 
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gi | 897 GATGTGTTCCTGCTGTGGCCCAAGGCCGGGAAGAGCCTCGAGGTGTACGCGCTGTTCAGC 
940 950 960 970 980 990 

1270 1280 1290 1300 1310 1320 

LEXl 5 1 ACCGTCAGTGCCGTGTTCCAGGGCTTCGCCGTCTGTGTGTACCACATGGCAGACATCTGG 



gi | 897 ACCGTCAGTGCCGTGTTCCAGGGCTTCGCCGTCTGTGTGTACCACATGGCAGACATCTGG 
1000 1010 1020 1030 1040 1050 

1330 1340 1350 1360 1370 1380 

LEXl 5 1 GAGGTTTTCAACGGGCCCTTTGCCCACCGAGATGGGCCTCAGCACCAGTGGGGGCCCTAT 



gi | 897 GAGGTTTTCAACGGGCCCTTTGCCCACCGAGATGGGCCTCAGCACCAGTGGGGGCCCTAT 
1060 1070 1080 1090 1100 1110 

1390 1400 1410 1420 1430 1440 

LEXl 5 1 GGGGGCAAGGTGCCCTTCCCTCGCCCTGGCGTGTGCCCCAGCAAGATGACCGCACAGCCA 



gi | 897 GGGGGCAAGGTGCCCTTCCCTCGCCCTGGCGTGTGCCCCAGCAAGATGACCGCACAGCCA 
1120 1130 1140 1150 1160 1170 

1450 1460 1470 1480 1490 1500 

LEXl 5 1 GGACGGCCTTTTGGCAGCACCAAGGACTACCCAGATGAGGTGCTGCAGTTTGCCCGAGCC 



gi | 897 GGACGGCCTTTTGGCAGCACCAAGGACTACCCAGATGAGGTGCTGCAGTTTGCCCGAGCC 
1180 1190 1200 1210 1220 1230 

1510 1520 1530 1540 1550 1560 

LEXl 51 CACCCCCTCATGTTCTGGCCTGTGCGGCCTCGACATGGCCGCCCTGTCCTTGTCAAGACC 

gi | 897 CACCCCCTCATGTTCTGGCCTGTGCGGCCTCGACATGGCCGCCCTGTCCTTGTCAAGACC 
1240 1250 1260 1270 1280 1290 

1570 1580 1590 1600 1610 1620 

LEX151 CACCTGGCCCAGCAGCTACACCAGATCGTGGTGGACCGCGTGGAGGCAGAGGATGGGACC 

gi | 897 CACCTGGCCCAGCAGCTACACCAGATCGTGGTGGACCGCGTGGAGGCAGAGGATGGGACC 
1300. 1310 1320 1330 1340 1350 

1630 1640 1650 1660 1670 1680 

~LEXr51~TACGATGTCATTTTCCTGGGGACT 

gi | 897 T AC G ATGTC ATTTTC CTGGGGAC TG AC TC AGGGTC TGTGC TC AAAGTC ATCGC TC TC C AG 
1360 1370 1380 1390 1400 1410 

1690 1700 1710 1720 1730 1740 

LEXl 5 1 GC AGGGGGCTC AGCTG AAC C TG AGGAAGTGGTTC TGG AGGAGC TC C AGGTGTTTAAGGTG 

gi | 897 GCAGGGGGCTCAGCTGAACCTGAGGAAGTGGTTCTGGAGGAGCTCC AGGTGTTTAAGGTG 
1420 1430 1440 1450 1460 1470 

1750 1760 1770 1780 1790 1800 

LEXl 5 1 CCAACACCTATCACCGAAATGGAGATCTCTGTCAAAAGGCAAATGCTATACGTGGGCTCT 

gi | 897 C C AAC AC C T ATC AC C GAAATGG AG ATC TC TGTC AAAAGGC AAATGC TATACGTGGGC TC T 
1480 1490 1500 1510 1520 153 0 

1810 1820 1830 1840 1850 1860 
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LEXl 5 1 CGGCTGGGTGTGGCCCAGCTGCGGCTGCACCAATGTGAGACTTACGGCACTGCCTGTGCA 

gi | 897 CGGCTGGGTGTGGCCCAGCTGCGGCTGCACCAATGTGAGACTTACGGCACTGCCTGTGCA 
1540 1550 1560 1570 1580 1590 

1870 1880 1890 1900 1910 1920 

LEXl 5 1 GAGTGCTGCCTGGCCCGGGACCCATACTGTGCCTGGGATGGTGCCTCCTGTACCCACTAC 

gi | 897 GAGTGCTGCCTGGCCCGGGACCCATACTGTGCCTGGGATGGTGCCTCCTGTACCCACTAC 
1600 1610 1620 1630 1640 1650 

1930 1940 1950 1960 1970 1980 

LEX 15 1 CGCCCCAGCCTTGGCAAGCGCCGGTTCCGCCGGCAGGACATCCGGCACGGCAACCCTGCC 

gi | 897 CGCCCCAGCCTTGGCAAGCGCCGGTTCCGCCGGCAGGACATCCGGCACGGCAACCCTGCC 
1660 1670 1680 1690 1700 1710 

1990 2000 2010 2020 2030 2040 

LEXl 5 1 CTGCAGTGCCTGGGCCAGAGCCAGGAAGAAGAGGCAGTGGGACTTGTGGCAGCCACCATG 

gi | 897 CTGCAGTGCCTGGGCCAGAGCCAGGAAGAAGAGGCAGTGGGACTTGTGGCAGCCACCATG 
1720 1730 1740 1750 1760 1770 

2050 2060 2070 2080 2090 2100 

LEXl 5 1 GTCTACGGCACGGAGCACAATAGCACCTTCCTGGAGTGCCTGCCCAAGTCTCCCCARGCT 
—::::::::::::::::::::::::::::::::::::::::::::::::::::::.::: 
gi | 897 GTCTACGGCACGGAGCACAATAGCACCTTCCTGGAGTGCCTGCCCAAGTCTCCCCAGGCT 
1780 1790 1800 1810 1820 1830 

2110 2120 2130 2140 2150 2160 

LEXl 5 1 GCTGTGCGCTGGCTCTTGCAGAGGCCAGGGGATGAGGGGCCTGACCAGGTGAAGACGGAC 

gi | 897 GC TGTGC GC TGGC TCTTGC AGAGGCC AGGGGATGAGGGGCCTG AC C AGGTG AAGAC GGAC 
1840 1850 1860 1870 1880 1890 

2170 2180 2190 2200 2210 2220 

LEXl 5 1 GAGCGAGTCTTGCACACGGAGCGGGGGCTGCTGTTCCGCAGGCTTAGCCGTTTCGATGCG 

gi | 897 GAGCGAGTCTTGCACACGGAGCGGGGGCTGCTGTTCCGCAGGCTTAGCCGTTTCGATGCG 
1900 . 1910 1920 1930 1940 1950 

2230 2240 2250 2260 2270 2280 

-LEX-l-5-l—GGGAGGTAGAGGTGG"AGGAGTeTGGAGCATGGCTTCTCCCAGACTGTGGTCCGCCTGGCT — 

gi | 897 GGCACCTACACCTGCACCACTCTGGAGCATGGCTTCTCCCAGACTGTGGTCCGCCTGGCT 
1960 1970 1980 1990 2000 2010 

2290 2300 2310 2320 2330 2340 

LEXl 5 1 CTGGTGGTGATTGTGGCCTC ACAGCTGGACAACCTGTTCCCTCCGGAGCCAAAGCCAGAG 

gi | 897 CTGGTGGTGATTGTGGCCTCACAGCTGGACAACCTGTTCCCTCCGGAGCCAAAGCCAGAG 
2020 2030 2040 2050 2060 2070 

2350 2360 2370 2380 2390 2400 

LEXl 5 1 GAGCCCCCAGCCCGGGGAGGCCTGGCTTCCACCCCACCCAAGGCCTGGTACAAGGACATC 

gi | 897 GAGCCCCCAGCCCGGGGAGGCCTGGCTTCCACCCCACCCAAGGCCTGGTACAAGGACATC 
2080 2090 2100 2110 2120 2130 

2410 2420 2430 2440 2450 2460 
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gi | 897 CTGCAGCTCATTGGCTTCGCCAACCTGCCCCGGGTGGATGAGTACTGTGAGCGCGTGTGG 
2140 2150 2160 2170 2180 2190 

2470 2480 2490 2500 2510 2520 

LEXl 5 1 TGCAGGGGCACCACGGAATGCTCAGGCTGCTTCCGGAGCCGGAGCCGGGGCAAGCAGGCC 



gi | 897 TGCAGGGGCACCACGGAATGCTCAGGCTGCTTCCGGAGCCGGAGCCGGGGCAAGCAGGCC 
2200 2210 2220 2230 2240 2250 

2530 2540 .2550 2560 2570. 2580 

LEXl 5 1 AGGGGCAAGAGCTGGGCAGGGCTGGAGCTAGGCAAGAAGATGAAGAGCCGGGTGCATGCC 



gi | 897 AGGGGCAAGAGCTGGGCAGGGCTGGAGCTAGGCAAGAAGATGAAGAGCCGGGTGCATGCC 
2260 2270 2280 2290 2300 2310 

2590 2600 2610 2620 

LEXl 5 1 GAGCAC AATCGGACGCCCCGGGAGGTGGAGGCCACGTAG 



g i | 8 9 7 GAGCAC AATC GGAC GCC CC GGGAGGTGGAGGCC AC GT AGAAGGGGGC AGAGG AGGGGTGG 
2320 2330 2340 2350 2360 2370 

gi | 897 TCAGGATGGGCTGGGGGGCCCACTAGCAGCCCCCAGCATCTCCCACCCACCCAGCTAGGG 
2380 2390 2400 2410 2420 2430 

»gi | 8978201 |dbj |AB029496 . 1 | Homo sapiens mRNA for semap (4700 nt) 
rev-comp initn: 83 initl : 83 opt: 95 
67.105% identity in 76 nt overlap (119-46:407-475) 

150 140 130 120 110 100 

LEXl 5 - GCGGGAAGAGGGGCGGAGGAGAGAAGGAGGCTGGGGCCTTGCCGTCCACCTGCCGCTTCT 

gi | 8 9 7 ACAACCGGACCCACCTGCTAGCCTGTGGCACTGGGGCCTTCCAGCCCACCTGTGCC - -CT 
380 390 400 410 420 430 

90 80 70 60 50 40 

LEXl 5- CCTTCCACCTTGTTGGCC-CAGTGCAG-GCTTTTGTGCCACACTGGCCAGCTCCCCATTG 



gi | 897 CATCACA GTTGGCCACCGTGGGGAGCATGTGCTCCAC-CTGGAGCCTGGCAGTGTG 

440 450 460 470 480 

3 0 2 0 10 

LEXl 5 - GGAAGACCTTCCCAGCTAGGGCACAGGCCAT 

gi | 897 GAAAGTGGCC GGGGGC GGTGC CC TC AC GAGCCC AGCC GTC CC TTTGCC AGC AC C TTC ATA 
490 500 510 520 530 540 



2628 residues in 1 query sequences 
4700 residues in 1 library sequences 
Scomplib [version 3.3t05 March 30, 2000] 

start: Fri Sep 19 13:50:44 2003 done: Fri Sep 19 13:50:45 2003 
Scan time: 0.117 Display time: 0.133 

Function used was FASTA 
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Paracel BLAST Results 



MEGABLAST 1 . 2 . 3-Paracel [2001-11-20] 
Reference ; 

Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2 000) , 
"A greedy algorithm for aligning DNA sequences", 
J Comput Biol 2000; 7 ( 1-2 ): 203-14 . 
Database : Homo_sapiens . latestgp . fa 

26,679 sequences; 200,800,637,119 total letters 

Query= 1 

(2629 letters) 



Score E 

Sequences producing significant alignments: (bits) Value 

AC006208. 3. 1.123943 940 0.0 

AC000063 .1.1.34478 ' 72 2e-09 

AC079799. 7. 1.172495 54 5e-04 



>AC006208. 3 .1.123943 

Length = 123943 



Score - 940 bits (474), Expect = 0.0 
Identities =474/474 (100%) 
Strand = Plus / Minus 



Query: 2156 caggtgaagacggacgagcgagtcttgcacacggagcgggggctgctgttccgcaggctt 2215 

IIIMIIIIIIIIIMIIII I II MINI llllll I MINI IMIIIIIIIII I Mill 

Sbjct: 44516 caggtgaagacggacgagcgagtcttgcacacggagcgggggctgctgttccgcaggctt 44457 
Query: 2216 agccgtttcgatgcgggcacctacacctgcaccactctggagcatggcttctcccagact 2275 

IMIIMIIIIIIII lllllll I Mill MINIMI MM I llllllll MM MINI 

Sbjct: 44456 agccgtttcgatgcgggcacctacacctgcaccactctggagcatggcttctcccagact 44397 
Query: 227 6 gtggtccgcctggctctggtggtgattgtggcctcacagctggacaacctgttccctccg 2335 

iiriTmrim^ 

Sbjct: 44396 gtggtccgcctggctctggtggtgattgtggcctcacagctggacaacctgttccctccg 44337 



Query: 233 6 gagccaaagccagaggagcccccagcccggggaggcctggcttccaccccacccaaggcc 23 95 

M II 1 1 1 II I M II II M M II 1 1 II I II II 1 1 M II I II II 1 1 1 1 1 M I II 1 1 I II II I 

Sbjct : 44336. gagccaaagccagaggagcccccagcccggggaggcctggcttccaccccacccaaggcc 44277 
Query: 2396 tggtacaaggacatcctgcagctcattggcttcgccaacctgccccgggtggatgagtac 2455 

1 1 1 M 1 1 f I f 1 1 1 1 1 1 1 1 1 1 1 1 If 1 1 1 1 1 1 1 1 1 f i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 M I 

Sbjct : 44276 tggtacaaggacatcctgcagctcattggcttcgccaacctgccccgggtggatgagtac 44217 



Query: 2456 tgtgagcgcgtgtggtgcaggggcaccacggaatgctcaggctgcttccggagccggagc 2515 

Ml i II INN II I MINI M 1 1 III I II Ml Ml I ill I I II II Mil M I I M M I I 

Sbjct : 44216 tgtgagcgcgtgtggtgcaggggcaccacggaatgctcaggctgcttccggagccggagc 44157 
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Query: 2516 cggggcaagcaggccaggggcaagagctgggcagggctggagctaggcaagaagatgaag 2575 

Mlllllilllll illlllllllMII! IIIIIMIMIII II! MM II III MINN 

Sbjct: 44156 cggggcaagcaggccaggggcaagagctgggcagggctggagctaggcaagaagatgaag 44097 
Query: 2576 agccgggtgcatgccgagcacaatcggacgccccgggaggtggaggccacgtag 2629 

Mil 1 1 1 II I i II 1 1 1 1 1 1 1 1 ! 1 1 II 1 1 1 1 i 1 1 Mill I i 1 1 1 1 1 1 1 1 i i 1 1 1 1 

Sbjct: 44096 agccgggtgcatgccgagcacaatcggacgccccgggaggtggaggccacgtag 44043 



Score = 781 bits (394), Expect =0.0 
Identities = 394/394 (100%) 
Strand = Plus / Minus 



Query : 2 atggcctgtgccc tagctgggaaggtcttcccaatggggagctggccagtgtggcacaaa 61 

1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 ! I M 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 

Sbjct: 53746 atggcctgtgccctagctgggaaggtcttcccaatggggagctggccagtgtggcacaaa 53687 
Query: 62 agcctgcactgggccaacaaggtggaaggagaagcggcaggtggacggcaaggccccagc 121 

1 1 1 1 1 1 1 1 1 1 1 ! II I i 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 ! i I II 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 

Sbjct: 53686 agcctgcactgggccaacaaggtggaaggagaagcggcaggtggacggcaaggccccagc 53627 
Query: 122 ctccttctctcctccgcccctcttcccgcccaggactgggtggagccactgccttataag 181 

. 1 1 II MM I II I II I II I II III I INI II Mill III II II II I II I MM I MM I II 

Sbjct: 53626 ctccttctctcctccgcccctcttcccgcccaggactgggtggagccactgccttataag 53567 
Query: 182 tggtggcctggtggcagcagagcaaactacaaccggcggccagcgggaccagagggcggc 241 

IMIIMIIIIIMI Ml II IIIIIIMIIIIIIIIIIIIII II MM MM II IMIII 

, Sbjct: 53566 tggtggcctggtggcagcagagcaaactacaaccggcggccagcgggaccagagggcggc 53507 
Query: 242 tctgcaggcaggcggcagcggtgccctcagttccccagcatggccccctcggcctgggcc 3 01 

IMIIIIIIMIII Ml IIMIIIIMI llllllllllll MMIIIIIMI M MUM 

Sbjct : 53506 tctgcaggcaggcggcagcggtgccctcagttccccagcatggccccctcggcctgggcc 53447 



Query: 302 



Sbjct: 



361 



atttgctggctgctagggggcctcctgctccatgggggtagctctggccccagccccggc 

IMIII 1 1 IMIMI Ml IMIIIIIMIII IMIMIII MM Ml IMIIM I Mill 

53446 atttgctggctgctagggggcctcctgctccatgggggtagctctggccccagccccggc 53387 



Query: 362 cccagtgtgccccgcctgcggctctcctaccgag 395 

Mill IIMIIIIMI 1 1 III 1 1 IMIII II I II 

Sbjct: 53386 cccagtgtgccccgcctgcggctctcctaccgag 53353 



Score = 462 bits (233), Expect = e-127 
Identities = 233/233 (100%) 
Strand = Plus / Minus 



Query: 1423 gtgccccagcaagatgaccgcacagccaggacggccttttggcagcaccaaggactaccc 1482 

IIIIIIIIMIIIIIII IMMMIIIIIIIIIIIMIIMI Mill MIIIMIIIMI 
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Sbjct: 48539 gtgccccagcaagatgaccgcacagccaggacggccttttggcagcaccaaggactaccc 48480 
Query: 1483 agatgaggtgctgcagtttgcccgagcccaccccctcatgttctggcctgtgcggcctcg 1542 

III! II Ml IIIMIIIIIII1IIIIII IMIIIIIIIII 1 1 1 II III IMIIIIMIII 

Sbjct : 48479 agatgaggtgctgcagtttgcccgagcccaccccctcatgttctggcctgtgcggcctcg 48420 
Query: 1543 acatggccgccctgtccttgtcaagacccacctggcccagcagctacaccagatcgtggt 1602 

IMIIMII IMIIIIMIIMI Mill MIIIIIIIIMI lllllllllllllllllll 

Sbjct: 48419 acatggccgccctgtccttgtcaagacccacctggcccagcagctacaccagatcgtggt 48360 
Query: 1603 ggaccgcgtggaggcagaggatgggacctacgatgtcattttcctggggactg 1655 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 f 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 

Sbjct : 48359 ggaccgcgtggaggcagaggatgggacctacgatgtcattttcctggggactg 48307 



Score = 456 bits (230) / Expect = e-125 
Identities = 230/230 (100%) 
Strand = Plus / Minus 



Query: 1789 gcaaatgctatacgtgggctctcggctgggtgtggcccagctgcggctgcaccaatgtga 1848 

MM II lllllll IIIIIIIMIMIII MIMMMIIIIII II MMMI llllllll 

Sbjct: 46640 gcaaatgctatacgtgggctctcggctgggtgtggcccagctgcggctgcaccaatgtga 46581 

Query: 1849 gacttacggcactgcctgtgcagagtgctgcctggcccgggacccatactgtgcctggga 1908 

I M 1 1 1 1 1 1 f 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 E 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 

Sbjct: 46580 gacttacggcactgcctgtgcagagtgctgcctggcccgggacccatactgtgcctggga 46521 



Query : 
Sbjct : 



1909 



46520 



tggtgcctcctgtacccactaccgccccagccttggcaagcgccggttccgccggcagga 1968 

1 1 MINI 1 1 MM MM 1 1 II I III Mill MINIM MINI I II I III MINIM 

tggtgcctcctgtacccactaccgccccagccttggcaagcgccggttccgccggcagga 46461 



Query: 1969 catccggcacggcaaccctgccctgcagtgcctgggccagagccaggaag 2 018 

MMM MM III Mill MIMMIMMIIIIIIMI I MIMIIIII 

Sbjct: 46460 catccggcacgg c aaccctgccctgcagtgcctgggccagagccaggaag 464 11 



Score = 327 bits (165), Expect = 2e-86 
Identities = 165/165 (100%) 
Strand = Plus / Minus 



Query: 394 agacctcctgtctgccaaccgctctgccatctttctgggcccccagggctccctgaacct 453 

M i N i u ii ii i ii i ii i ii ii ii 1 1 1 1 ii i ii ii ii 1 1 1 1 1 1 1 1 1 ii ii i M i i ii ii 

Sbjct : 51349 agacctcc tgtctgccaaccgc tctgccatctttctgggcccccagggctccctgaacct 51290 



Query: 454 ccaggccatgtacctagatgagtaccgagaccgcctctttctgggtggcctggacgccct 513 

1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 r 1 1 r 1 1 1 1 1 f 1 1 1 1 f 1 1 1 1 j 1 1 i i f 1 1 f r 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct: 51289 ccaggccatgtacctagatgagtaccgagaccgcctctttctgggtggcctggacgccct 51230 
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Query: 514 



Sbjct : 



ctactctctgcggctggaccaggcatggccagatccccgggaggt 558 

I 1 1 1 1 1 1 1 1 i I II 1 1 i 1 1 1 I I I 1 1 1 1 1 1 1 1 1 1 1 I I M ii 1 1 1 1 1 1 

51229 ctactctctgcggctggaccaggcatggccagatccccgggaggt 51185 



Score = 294 bits (148), Expect = 3e-76 
Identities = 148/148 (100%) 
Strand = Plus / Minus 

Query: 127 6 cagtgccgtgttccagggcttcgccgtctgtgtgtaccacatggcagacatctgggaggt 1335 

1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Sbjct: 48964 cagtgccgtgttccagggcttcgccgtctgtgtgtaccacatggcagacatctgggaggt 48905 

Query: 1336 tttcaacgggccctttgcccaccgagatgggcctcagcaccagtgggggccctatggggg 13 95 

I III II III II Mill III Ml I II MM MM I II IIIMIIIII llllll IIMIIII 

Sbjct: 48904 tttcaacgggccctttgcccaccgagatgggcctcagcaccagtgggggccctatggggg 48845 



Query: 1396 caaggtgcccttccctcgccctggcgtg 1423 

1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 

Sbjct: 48844 caaggtgcccttccctcgccctggcgtg 48817 



Score = 292 bits (147), Expect = le-75 
Identities = 147/147 (100%) 
Strand = Plus / Minus 



Query: 947 gacccccggtttgtgatggccgcccggatccctgagaactctgaccaggacaatgacaag 1006 

1 1 1 1 M E 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 it II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 

Sbjct: 49850 gacccccggt ttgtgatggccgcccggatccctgagaactctgaccaggacaatgacaag 49791 
Query: 1007 gtgtacttcttcttctcggagacggtcccctcgcccgatggtggctcgaaccatgtcact 1066 

1 1 Ml II Ml Ml II I Ml Ml II I Ml Mill Ml II MM MUM MIIMI Ml II 

Sbjct: 49790 gtgtacttcttcttctcggagacggtcccctcgcccgatggtggctcgaaccatgtcact 49731 



Query: 1067 gtcagccgcgtgggccgcgtctgcgtg 1093 

I Ml II MIIMI Mill I II 1 1 II II 

Sbjct: 49730 gtcagccgcgtgggccgcgtctgcgtg 49704 



Score = 286 bits (144), Expect = 6e-74 
Identities = 145/146 (99%) 
Strand - Plus / Minus 



Query: 2 017 agaagaggcagtgggacttgtggcagccaccatggtctacggcacggagcacaatagcac 207 6 

I I II I 1 1 II I I 1 1 1 1 1 1 I II I I M I II II M I I I II II II I I II II I M II I M 1 1 II II 

Sbjct : 46108 agaagaggcagtgggacttgtggcagccaccatggtctacggcacggagcacaatagcac 46049 
Query: 2077 cttcctggagtgcctgcccaagtctccccargctgctgtgcgctggctcttgcagaggcc 213 6 

I ! { I I I i r I j 1 1 1 1 ! I I 1 1 I [ I ! I I 1 1 1 1 1 Mill MM MIIMI MUM Ml MM 
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Sbjct: 46048 cttcctggagtgcctgcccaagtctccccaggctgctgtgcgctggctcttgcagaggcc 45989 
Query: 2137 aggggatgaggggcctgaccaggtga 2162 

llllllllllllllllllllll MM 

Sbjct: 45988 aggggatgaggggcctgaccaggtga 45963 



Score = 240 bits (121), Expect = 3e-60 
Identities = 121/121 (100%) 
Strand = Plus / Minus 

Query: 619 gacagagtgcgccaacttcgtgcgggtgctacagcctcacaaccggacccacctgctagc 678 

MMMMMMMMMMM MMM MMMMMMMMIMMMMMMMI 

Sbjct: 50745 gacagagtgcgccaacttcgtgcgggtgctacagcctcacaaccggacccacctgctagc 50686 
Query: 679 ctgtggcactggggccttccagcccacctgtgccctcatcacagttggccaccgtgggga 738 

M 1 1 1 1 1 1 1 m f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 I i 1 1 1 M I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct: 50685 ctgtggcactggggccttccagcccacctgtgccctcatcacagttggccaccgtgggga 50626 

Query: 739 g 739 
I 

Sbjct: 50625 g 50625 



Score = 236 bits (119), Expect = 5e-59 
Identities = 119/119 (100%) 
Strand = Plus / Minus 

Query: 82 9 agacggggagctgtacacgggtctcactgctgacttcctggggcgagaggccatgatctt 888 

1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 F I f 1 1 1 1 1 1 1 

Sbjct: 50132 agacggggagctgtacacgggtctcactgctgacttcctggggcgagaggccatgatctt 50073 
Query: 889 ccgaagtggaggtcctcggccagctctgcgttccgactctgaccagagtctcttgcacg 947 

JJJJJJJJJJJJJJJJJJJJJJJJJJ-LLLLLLLLLLLLLLLLLLLIJJJJJ-LLLLLLL 



Sbjct : 50072 ccgaagtggaggtcctcggccagctctgcgt tccgactctgaccagagtc tcttgcacg 50014 



Score = 230 bits (116), Expect = 3e-57 
Identities = 116/116 (100%) 
Strand = Plus / Minus 

Query: 1093 gaatgatgctgggggccagcgggtgctggtgaacaaatggagcactttcctcaaggccag 1152 

1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

Sbjct : 49489 gaatgatgctgggggccagcgggtgctggtgaacaaatggagcactttcctcaaggccag 49430 
Query: 1153 gctggtctgctcggtgcccggccctggtggtgccgagacccactttgaccagctag 12 08 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I. I I I I 1 I I I I E I I 

Sbjct: 49429 gctggtctgctcggtgcccggccctggtggtgccgagacccactttgaccagctag 49374 
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Score = 188 bits (95), Expect = le-44 
Identities = 95/95 (100%) 
Strand = Plus / Minus 

Query: 1655 gactcagggtctgtgctcaaagtcatcgctctccaggcagggggctcagctgaacctgag 1714 

IIIIIIMIIIIIIMIIIIIIII lllllllilllllll II Ml Mill Illli Mill I 

Sbjct: 48212 gactcagggtctgtgctcaaagtcatcgctctccaggcagggggctcagctgaacctgag 48153 
Query: 1715 gaagtggttctggaggagctccaggtgtttaaggt 1749 

1 1 1 1 1 M III 1 1 1 II MM! I II I II 1 1 1 1 1 1 1 1 1 

Sbjct: 48152 gaagtggttctggaggagctccaggtgtttaaggt 48118 



Score = 184 bits (93), Expect = 2e-43 
Identities = 93/93 (100%) 
Strand = Plus / Minus 

Query: 738 agcatgtgctccacctggagcctggcagtgtggaaagtggccgggggcggtgccctcacg 797 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

Sbjct : 50351 agcatgtgctccacctggagcctggcagtgtggaaagtggccgggggcggtgccctcacg 50292 
Query: 798 agcccagccgtccctttgccagcaccttcatag 830 

M II IMIII I II II III II II I Mill Ml II 

Sbjct: 50291 agcccagccgtccctttgccagcaccttcatag 50259 



Score = 143 bits (72), Expect = 6e-31 
Identities = 72/72 (100%) 
Strand = Plus / Minus 

Query: 1207 agaggatgtgttcctgctgtggcccaaggccgggaagagcctcgaggtgtacgcgctgtt 12 66 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 f i i 1 1 1 f 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 

Sbjct: 49265 agaggatgtgttcctgctgtggcccaaggccgggaagagcctcgaggtgtacgcgctgtt 49206 



Query: 1267 cagcaccgtcag 1278 

1 1 M 1 1 1 1 1 1 1 1 

Sbjct: 49205 cagcaccgtcag 49194 



Score = 129 bits (65), Expect = 9e-27 
Identities = 65/65 (100%) 
Strand = Plus / Minus 



Query: 555 aggtcctgtggccaccgcagccaggacagagggaggagtgtgttcgaaagggaagagatc 614 

MIIIIIIIIIIIIIMII MUM lllllllllllllllll IIIIMM MM M MM 

Sbjct : 51063 aggtcctgtggccaccgcagccaggacagagggaggagtgtgttcgaaagggaagagatc 51004 
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Query: 615 ctttg 619 
INN 

Sbjct: 51003 ctttg 50999 



Score = 87.8 bits (44), Expect = 3e-14 
Identities = 44/44 (100%) 
Strand = Plus / Minus 



Query: 1746 aggtgccaacacctatcaccgaaatggagatctctgtcaaaagg 1789 
Sbict- 47403 '''''''''''UIMirillllllllMIIIIIIIIIIIMII 
SJDjct. 47403 aggtgccaacacctatcaccgaaatggagatctctgtcaaaagg 47360 



>AC000063 .1.1.34478 

Length = 34478 

Score =71.9 bits (36), Expect = 2e-09 
Identities = 48/52 (92%) 
Strand = Plus / Minus 



Query: 1860 ctgcctgtgcagagtgctgcctggcccgggacccatactgtgcctgggatgg 1911 

sbict ,711 'I 111 1 11 11 ""MM MMMMMI MMMMMMIMM 

Sbjct: 5711 ctgcctgtgctgactgctgccttgcccgggaccct^ 5660 



>AC079799. 7. 1.172495 

Length = 172495 

Score = 54.0 bits (27), Expect 
Identities = 42/47 (89%) 
Strand = Plus / Minus 



= 5e-04 



Query: 1865 tgtgcagagtgctgcctggcccgggacccatactgtgcctgggatgg 1911 

sbjct- 151014 U U't" """""' " "'" IIMMMMIMMM 

Sbact. 151014 tgtgctgactgctgcctggctcgagacccttactgtgcctgggatgg 150968 



Database : Homo_sapiens . latestgp . fa 

Posted date: Jul 8, 2003 12:51 PM 
Number of letters in database: 200,800,637,119 
Number of sequences in database: 26,679 

Lambda k h 

1.37 0.711 1.31 

Gapped 

Lambda K H 

1.37 0.711 1.31 



Matrix: blastn matrix: 1 -3 

Gap Penalties: Existence: 0, Extension: 0 
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Number of Hits to DB: 0 

length of query: 5260 

length of database: 200,800,637,119 

effective HSP length: 22 

effective length of query: 2607 

effective search space used: 0 

T: 0 

A: 0 

XI: 0 ( 0.0 bits) 
X2: 20 (39.7 bits) 
SI: 12 (24.3 bits) 
S2: 24 (48.. 1 bits) 
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